Uncertain Graph Processing through Representative Instances and Sparsification
نویسندگان
چکیده
Data in several applications can be represented as an uncertain graph, whose edges are labeled with a probability of existence. Currently, most query and mining tasks on uncertain graphs are based on Monte-Carlo sampling, which is rather time consuming for the large uncertain graphs commonly found in practice (e.g., social networks). To overcome the high cost, in this doctoral work we propose two approaches. The first extracts deterministic representative instances that capture structural properties of the uncertain graph. The query and mining tasks can then be efficiently processed using deterministic algorithms on these representatives. The second approach sparsifies the uncertain graph (i.e., reduces the number of its edges) and redistributes its probabilities, minimizing the information loss. Then, Monte-Carlo sampling applied to the reduced graph becomes much more efficient.
منابع مشابه
Uncertain Graph Sparsification
Uncertain graphs are prevalent in several applications including communications systems, biological databases and social networks. The ever increasing size of the underlying data renders both graph storage and query processing extremely expensive. Sparsification has often been used to reduce the size of deterministic graphs by maintaining only the important edges. However, adaptation of determi...
متن کاملSparsification Algorithm for Cut Problems on Semi-streaming Model
The emergence of social networks and other interaction networks have brought to fore the questions of processing massive graphs. The (semi) streaming model, where we assume that the space is (near) linear in the number of vertices (but not necessarily the edges) is an useful and efficient model for processing large graphs. In many of these graphs the numbers of vertices are significantly less t...
متن کاملSimilarity-Aware Spectral Sparsification by Edge Filtering
In recent years, spectral graph sparsification techniques that can compute ultra-sparse graph proxies have been extensively studied for accelerating various numerical and graphrelated applications. Prior nearly-linear-time spectral sparsification methods first extract low-stretch spanning tree from the original graph to form the backbone of the sparsifier, and then recover small portions of spe...
متن کاملON THE MATCHING NUMBER OF AN UNCERTAIN GRAPH
Uncertain graphs are employed to describe graph models with indeterministicinformation that produced by human beings. This paper aims to study themaximum matching problem in uncertain graphs.The number of edges of a maximum matching in a graph is called matching numberof the graph. Due to the existence of uncertain edges, the matching number of an uncertain graph is essentially an uncertain var...
متن کاملAdaptation of Graph-Based Semi-Supervised Methods to Large-Scale Text Data
Graph-based semi-supervised learning methods have shown to be efficient and effective on network data by propagating labels along neighboring nodes. These methods can also be applied to general data by constructing a graph where the nodes are the instances and the edges are weighted by the similarity between feature vectors of instances. However, whereas a natural network is often sparse, a net...
متن کامل